Evaluating the Use of Prosodic Information in Speech Recognition and Understanding

نویسندگان

Mari Ostendorf

Patti Price

چکیده

The goal of this project is to investigate the use of different levels of prosodic information in speech recognition and understanding. In particular, the current focus of the work is the use of prosodic phrase boundary information in parsing. The research involves determining a representation of prosodic information suitable for use in a speech understanding system, developing reliable algorithms for detection of the prosodic cues in speech, investigating architectures for integrating prosodic cues in a parser, and evaluating the potential improvements of prosody in the context of the SRI Spoken Language System. This research is sponsored jointly by DARPA and NSF.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Prosodic elements to improve pronunciation in English language learners: A short report

The usefulness of teaching pronunciation in language instruction remains controversial. Though past research suggests that teachers can make little or no difference in improving their students’ pronunciation, current findings suggest that second language pronunciation can improve to be near native-like with the implementation of certain criteria such as the utilization of...

متن کامل

Study on Detection of Prosodic Phrase Boundaries in Spontaneous Speech

Prosodic information, which has the abilities of disambiguation, improving the parsing of the spoken language and predicting recognition errors, becomes more and more important in speech recognition and understanding, especially in spontaneous speech. In this paper, we investigate the detection of the phrase boundaries by prosodic features in the domain-specified Chinese spontaneous speech. The...

متن کامل

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...

متن کامل

A generalised model for utilising prosodic information in continuous speech recognition

Prosodic features in continuous speech provide cues which may be used to disambiguate syntactic ambiguities and to increase the accuracy of speech recognition/understanding systems. This paper presents a novel method using a multivariate statistical framework for producing a model of the relationship between prosodic and syntactic structures in continuous speech. The model can be used for Lingu...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1989

Evaluating the Use of Prosodic Information in Speech Recognition and Understanding

نویسندگان

چکیده

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Prosodic elements to improve pronunciation in English language learners: A short report

Study on Detection of Prosodic Phrase Boundaries in Spontaneous Speech

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

A generalised model for utilising prosodic information in continuous speech recognition

عنوان ژورنال:

اشتراک گذاری